AITopics

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.69)

Neural Information Processing SystemsFeb-11-2026, 05:08:41 GMT

fa78a16157fed00d7a80515818432169-Supplemental.pdf

Horovod: fast and easy distributed deep learning in tensorflow.

artificial intelligence, machine learning, prediction, (12 more...)

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.06)
North America > United States > Oregon > Multnomah County > Portland (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.56)

Neural Information Processing SystemsFeb-9-2026, 07:16:58 GMT

VideoInstanceSegmentationusing Inter-FrameCommunicationTransformers

These per-frame methods extend the concept oftemporal tracking bymatching frame-wise predictions ofhighsimilarities.

artificial intelligence, etal, transformer, (17 more...)

Technology: Information Technology > Artificial Intelligence (0.68)

Neural Information Processing SystemsFeb-9-2026, 03:46:47 GMT

5820ad65b1c27411417ae8b59433e580-Supplemental-Conference.pdf

dealignment, encoder block, temporal modeling, (12 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

arXiv.org Artificial IntelligenceNov-26-2025

PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting

Zhao, Bowen, Xing, Huanlai, Xiao, Zhiwen, Peng, Jincheng, Feng, Li, Wang, Xinhan, Qu, Rong, Li, Hui

PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting Bowen Zhao, Huanlai Xing, Zhiwen Xiao, Jincheng Peng, Li Feng, Xinhan Wang, Rong Qu, Hui Li The proposed PeriodNet hybridizes a period attention mechanism, an iterative grouping mechanism, and a period diffuser architecture to achieve accurate multivariate time series forecasting. The period attention mechanism captures temporal similarities among adjacent periods to improve time series modeling. The period diffuser architecture leverages multi-scale period features extracted by the encoder to enhance the accuracy and efficiency of time series forecasting. Abstract The attention mechanism has demonstrated remarkable potential in sequence modeling, exemplified by its successful application in natural language processing with models such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformer (GPT). Despite these advancements, its utilization in time series forecasting (TSF) has yet to meet expectations. Exploring a better network structure for attention in TSF holds immense significance across various domains. In this paper, we present PeriodNet with a brand new structure to forecast univari-ate and multivariate time series. PeriodNet incorporates period attention and sparse period attention mechanism for analyzing adjacent periods. It enhances the mining of local characteristics, periodic patterns, and global dependencies. For efficient cross-variable modeling, we introduce an iterative grouping mechanism which can directly reduce the cross-variable redundancy. To fully leverage the extracted features on the encoder side, we redesign the entire architecture of the vanilla Transformer and propose a period diffuser for precise multi-period prediction. Through comprehensive experiments conducted on eight datasets, we demonstrate that PeriodNet outperforms six state-of-the-art models in both univariate and multivariate TSF scenarios in terms of mean square error and mean absolute error.

data mining, large language model, machine learning, (18 more...)

2511.19497

Country:

Asia > China (0.46)
Europe > Austria (0.28)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.94)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Gerçek, Alinda Ezgi, Korten, Till, Chekhonin, Paul, Hassan, Maleeha, Steinbach, Peter

Data-efficient U-Net for Segmentation of Carbide Microstructures in SEM Images of Steel Alloys

arXiv.org Artificial IntelligenceNov-17-2025

Understanding reactor-pressure-vessel steel microstructure is crucial for predicting mechanical properties, as carbide precipitates both strengthen the alloy and can initiate cracks. In scanning electron microscopy images, gray-value overlap between carbides and matrix makes simple thresholding ineffective. We present a data-efficient segmentation pipeline using a lightweight U-Net (30.7~M parameters) trained on just \textbf{10 annotated scanning electron microscopy images}. Despite limited data, our model achieves a \textbf{Dice-Sørensen coefficient of 0.98}, significantly outperforming the state-of-the-art in the field of metallurgy (classical image analysis: 0.85), while reducing annotation effort by one order of magnitude compared to the state-of-the-art data efficient segmentation model. This approach enables rapid, automated carbide quantification for alloy design and generalizes to other steel types, demonstrating the potential of data-efficient deep learning in reactor-pressure-vessel steel analysis.

artificial intelligence, machine learning, rensen coefficient, (13 more...)

2511.11485

Country: Europe > Germany (0.16)

Genre: Research Report (0.65)

Industry:

Energy (0.55)
Materials > Metals & Mining > Steel (0.41)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

Maity, Soumyajit, Kamboj, Pranjal, Maity, Sneha, Singh, Rajat, Chatterjee, Sankhadeep

Google-MedGemma Based Abnormality Detection in Musculoskeletal radiographs

arXiv.org Artificial IntelligenceNov-11-2025

This paper proposes a MedGemma-based framework for automatic abnormality detection in musculoskeletal radiographs. Departing from conventional autoencoder and neural network pipelines, the proposed method leverages the MedGemma foundation model, incorporating a SigLIP-derived vision encoder pretrained on diverse medical imaging modalities. Preprocessed X-ray images are encoded into high-dimensional embeddings using the MedGemma vision backbone, which are subsequently passed through a lightweight multilayer perceptron for binary classification. Experimental assessment reveals that the MedGemma-driven classifier exhibits strong performance, exceeding conventional convolutional and autoencoder-based metrics. Additionally, the model leverages MedGemma's transfer learning capabilities, enhancing generalization and optimizing feature engineering. The integration of a modern medical foundation model not only enhances representation learning but also facilitates modular training strategies such as selective encoder block unfreezing for efficient domain adaptation. The findings suggest that MedGemma-powered classification systems can advance clinical radiograph triage by providing scalable and accurate abnormality detection, with potential for broader applications in automated medical image analysis. Keywords: Google MedGemma, MURA, Medical Image, Classification.

abnormality detection, artificial intelligence, machine learning, (16 more...)

2511.056

Country: North America > United States > Texas (0.15)

Genre: Research Report (0.70)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Wang, Shang, Tang, Huanrong, Ouyang, Jianquan

A Transformer-based Neural Architecture Search Method

arXiv.org Artificial IntelligenceNov-3-2025

This paper presents a neural architecture search method based on Transformer architecture, searching cross multihead attention computation ways for different number of encoder and decoder combinations. In order to search for neural network structures with better translation results, we considered perplexity as an auxiliary evaluation metric for the algorithm in addition to BLEU scores and iteratively improved each individual neural network within the population by a multi-objective genetic algorithm. Experimental results show that the neural network structures searched by the algorithm outperform all the baseline models, and that the introduction of the auxiliary evaluation metric can find better models than considering only the BLEU score as an evaluation metric.

artificial intelligence, machine learning, natural language, (18 more...)

2505.01314

Country:

Europe > Portugal (0.17)
Asia > China (0.16)
North America > United States (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Abdel-Ghani, Muraam, Ali, Mahmoud, Ali, Mohamed, Ahmed, Fatmaelzahraa, Arsalan, Muhammad, Al-Ali, Abdulaziz, Balakrishnan, Shidin

FASL-Seg: Anatomy and Tool Segmentation of Surgical Scenes

arXiv.org Artificial IntelligenceOct-31-2025

The growing popularity of robotic minimally invasive surgeries has made deep learning-based surgical training a key area of research. A thorough understanding of the surgical scene components is crucial, which semantic segmentation models can help achieve. However, most existing work focuses on surgical tools and overlooks anatomical objects. Additionally, current state-of-the-art (SOT A) models struggle to balance capturing high-level contextual features and low-level edge features. We propose a Feature-Adaptive Spatial Localization model (FASL-Seg), designed to capture features at multiple levels of detail through two distinct processing streams, namely a Low-Level Feature Projection (LLFP) and a High-Level Feature Projection (HLFP) stream, for varying feature resolutions - enabling precise segmentation of anatomy and surgical instruments. We evaluated FASL-Seg on surgical segmentation benchmark datasets EndoVis18 and EndoVis17 on three use cases. The FASL-Seg model achieves a mean Intersection over Union (mIoU) of 72.71% on parts and anatomy segmentation in EndoVis18, improving on SOT A by 5%. It further achieves a mIoU of 85.61% and 72.78% in EndoVis18 and EndoVis17 tool type segmentation, respectively, outperforming SOT A overall performance, with comparable per-class SOT A results in both datasets and consistent performance in various classes for anatomy and instruments, demonstrating the effectiveness of distinct processing streams for varying feature resolutions.

artificial intelligence, machine learning, segmentation, (17 more...)